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ABSTRACT. We present a large-deviations/thermodynamic approach to the classic problem of 
percolation on the complete graph. Specifically, we determine the large-deviation rate function 
for the probability that the giant component occupies a fixed fraction of the graph while all other 
components are "small." One consequence is an immediate derivation of the "cavity" formula for 
the fraction of vertices in the giant component. As a by-product of our analysis we compute the 
large-deviation rate functions for the probability of the event that the random graph is connected, 
the event that it contains no cycles and the event that it contains only "small" components. 

1. Introduction 

For physical systems, mean-field theory often provides a qualitatively correct description of "re- 
alistic behavior." The corresponding analysis usually begins with the derivation of so called 
mean-field equations which are self-consistent relations involving the physical quantity of pri- 
mary interest and the various parameters of the model. This approach may be realized and, to 
some extent, justified mathematically by considering the model on the complete graph where 
each constituent interacts with all others. 

As an example, let us consider the Ising model on a complete graph K n of n vertices. Here we 
have a collection of ±l-valued random variables (<7j)™ =1 which are distributed according to the 
probability measure /^n({c}) = e~^ Hn ^ /Z n> p, where 

j n n 

H n (a) = aiUj - h^ai (1.1) 

71 . . . 

i,J = l 1=1 

and where (3, h are parameters. The relevant physical quantity is the empirical magnetization, 
m n (a) = n _1 Ya=i a i- m terms °f this quantity, H n (a) = —\n[m n (a)\ 2 — hm n (a) and so 

^ 1) = tanh[/3(m n (cT) + h)) + O(Vn). (1.2) 

This permits the following "cavity argument:" Supposing that m n tends, as n — ► oo, to a value m± 
in probability, we have that m* = lim n ^ 00 E n (a\ ) obeys 

= tanh[/?(m* + h)] . (1.3) 

This is the mean-field equation for the (empirical) magnetization. Of course, the concentration of 
the law of m n still needs to be justified; cf [20] for details. 
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In the context of percolation [22], the relevant mean-field model goes under the name the 
Erdos-Renyi Random Graph. Here each edge of K n is independently occupied with probabil- 
ity a / n , where < a < oo, and vacant with probability 1 — a / n . The relevant "physical" quantity 
is the giant-component density q±, i.e., the limiting fraction of the vertices that belong to the giant 
component of the graph. The corresponding mean-field equation, 

^ = l- e -^, (1.4) 

is also readily derived from heuristic "cavity" considerations. As is well known, q+ = is the 
only solution for a < a c = 1, while for a > a c there is another, strictly positive solution. This 
solution tends to zero as a [ a c ; hence we may speak of a continuous transition. 

While (1.3-1.4) are indeed straightforward to derive, matters at the level of mean-field equa- 
tions are not always satisfactory; the problem being the existence multiple solutions. As it turns 
out, for the percolation model (as well as the /c-core percolation) the proper choice is always the 
maximal solution, but prescriptions of this sort generically fail, e.g., for the Ising model (1.3) 
with h < and, as often as not, whenever there is a first-order transition. Thus, one is in need of 
an additional principle which determines which of the solutions is relevant. 

The existing mathematical approach to these difficulties — e.g., for percolation [21], see also [2, 
13, 24], or the /c-core [34] — is to work with sufficient precision until the mean-field conclusions 
are rigorously established. Another approach — which admits some prospects of extendability 
beyond the complete graph [5, 6] — is to supplement the picture by the introduction of the mean- 
field free-energy junction. For the Ising model, this is a function m ^ <&p^(m) such that 

fi n (m n (a) ps m) = e -"*/3>M+°H, „ _ oo, (1.5) 

i.e., m i — ^ ^(m) is the large-deviation rate function for the probability of observing the event 
{m n (a) ps m}. This spells the end of the story from the perspective of probability and/or theo- 
retical physics: One seeks the minimum of the free energy function, setting its derivative to zero 
yields the mean-field equations with the irrelevant solutions corresponding to the local extrema 
which are not absolute minima; see again [20]. 

The free-energy approach to mean-field problems has met with success in Ising systems and, to 
some extent, it has been applied to the Potts and random-cluster models [14, 31,18, 29]. However, 
no attempt seems to have been made to extend this technology to "purely geometrical" problems 
on the complete graph, specifically, ordinary percolation or /c-core percolation. The purpose of 
this note is to derive the large-deviation rate function for the event that the random graph contains 
a fraction q of vertices in "large" components. As we will see, the function has a unique minimum 
for all a which coincides with the "correct" solution of (1.4). We do not necessarily claim that 
the resultant justification of this equation is easier than which already exists in the literature. 
However, the picture presented here provides some additional insights into the model while the 
overall approach indeed admits the possibility of generalizations. 



2. Main results 

Consider the set of vertices Y = {1, . . . , n} and let (tOki)i<k<i<n t> e a collection of i.i.d. random 

variables taking value one with probability p and zero with probability 1 — p. Let £ = 

be the (random) set {(k,l): 1 < k < I < n, ujj-i = !}• In accord with the standard notation, 
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Figure 1 . The graph of the free energy function g i— » &(q, a) for four distinct values of a. 
For a < 1, the function is minimized by g = 0, while for a > 1 the unique minimum occurs 
at q > 0. In any case, the minimizer is the maximal positive solution for g+ from (1.4). The 
dashed portion of the graphs for a = 1.6 and 2.4 marks the part where the background contri- 
bution, ^(a(l — gj), to a) in (2.4) is strictly positive. This rules out the zero solution to 
(1.4) for all a > 1. 



cf [2, 13], we will use &(n,p) to denote the undirected graph with vertices f and edges S. Of 
particular interest are the cases where p decays to zero proportionally to 1 / n . Since these are the 
only problems we will consider, let us set, for once and all, p = a / n for some fixed a G [0, oo). 
We will denote the requisite probability measure by P n , a - 

In order to state our main theorems, we need to introduce some notation. First, consider the 
standard entropy function 



and let 



S(g) = Qlogg+ (1 - q) log(l - q) 
7ri(a) = l-e~ Q . 



In addition, consider the function 



*(a) = log a - 



a — 



a 



AO 



(2.1) 



(2.2) 



(2.3) 
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and note that \I/(a) < if and only if a > 1. Finally, let us also define 

®(g,a) = S(g) - glogTT^ag) 

-(l- e )log[l-iri(a e )] -(l- e )*(a(l- e )). (2-4) 

Then we have: 

Theorem 2.1 Consider Sf(n, a / n ) and let be the set of vertices that are in connected compo- 
nents of size larger than r. Then for every q G [0, 1], 

lim lim P n , a {\r en \ = [ 6 n\) 1/n = e"*^. (2.5) 

An inspection of Lemma 6.2 reveals that, conditional on {|^ n | = L^ n J}' w i tn e > 0, there 
will be only one "large" component with probability tending to one as n — > oo. 

Fig. 1 shows the graph of <I> for various values of a which is archetypal of free-energy functions 
in complete graph setting. The figure indicates a unique global minimum; direct, albeit arduous 
differentiation of (2.4) yields the fact that all local extrema satisfy the mean-field equation (1.4). 
The externum at q = is ruled out for a > 1 by noting that, under these conditions, the last term 
in (2.4) is strictly positive. 

The corresponding conclusion may also be extracted from the following probabilistic argu- 
ment: Let m = Yqu\ and note that e~ nS ^ is then the exponential growth-rate of (^). This 
allows us to write 

e -n*( ft a) = e o(n) ^ ^(ag)]™^ - TTl (ag)] " . (2.6) 

Neglecting the ^-term (which provides a lower bound on <J>), one sees a quantity reminiscent 
of binomial distribution. Well known results on the latter inform us that the right-hand side is 
exponentially small unless 

-K-iiptQ) w — , (2.7) 
n 

i.e., unless g satisfies the mean-field equation (1.4). If * is set to zero, there are degenerate 
minima for a > 1; however, the ^-function will lift the degeneracy and, in fact, create a local 
maximum at g = once a > 1. Meanwhile, in the region of the maximal solution, ^ has 
vanished and the above mentioned approximation is exact. 

Remarks 2.2 (1) A closely-related, but different problem to the one treated above has previously 
been studied using large-deviation techniques. Indeed, in [32], O'Connell derived the large- 
deviation rate function for the event that the largest connected component is of size about nn. 
Note, however, that this does not restrict the total volume occupied by these component. For k 
close to g+ from (1.4) — explicitly, as long as the complement of the large component has ef- 
fective a less than 1 — O'Connell's rate function coincides with ours. But once k is sufficiently 
small, his conditioning will lead to the creation of several large components whose total volume 
is such that their complement is effectively subcritical. Consequently, O'Connell never needs to 
address the central issue of our proof; namely, the decay rate of the probability that supercriti- 
cal percolation has no giant components. (This is what gives rise to the term \& in (2.4) and the 
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dashed portion of the graph in Fig. 1 .) In fact, his rate function is basically a concatenation of 
many scaled copies of the undashed portion of the graph in Fig. 1 . 

(2) While the ^-term in (2.4) has a non-trivial effect on the large-deviation questions studied 
here, it does not play any role for events whose probability is of order unity (or is subexponential 
in n). This is because ^ "kicks in" only for g away from the minimizing value. This is not 
the case for the fc-core where the corresponding large-deviation analysis [8] suggests that the 
analogous term "kicks in" right at the minimizer and may even affect the fluctuation scales. One 
way to bring <!> out of the "realm of exponentially-improbable" for percolation would be to give 
each configuration a weight suppressing large components. However, we will not pursue these 
matters in the present note. 

(3) Our control of the rate function is not sharp enough to provide a detailed description of the 
critical region, i.e., the situations when a = 1 + 0(n -1 / 3 ). The corresponding analysis of the 
scaling phenomena inside the "critical window" has been performed in [12, 28, 30, 33, 15]. On 
the other hand, for a > 1 one should be able to sharpen the control of the rate function near its 
minimum to derive a CLT for the fluctuations of the size of the giant component. 

Several ingredients enter our proof of Theorem 2. 1 which are of independent interest. We state 
these as separate theorems. The first one concerns the exponential decay rate for the probability 
that the random-graph is (completely) connected: 

Theorem 2.3 Let K denote the event that %?(n, a / n ) is connected. Then 

P n , a (K) = (1 - e- a ) n e°^ n \ n ^ oo, (2.8) 
where 0(log n) is bounded by a constant times log n uniformly on compact sets ofaG [0, oo). 

We remark that Theorem 2.3 holds with e°( logn ) replaced by C(a) + o(l), see [4] for a proof. 
However, the requisite steps seem far in excess of the derivation in Sect. 3. Furthermore, various 
pieces of Theorem 2.3 have been discovered, apparently multiple times, in [36, 26, 37, 27, 3]; cf 
also the discussion following Lemma 3.3. 

Next we present a result concerning the event that W(n, a / n ) contains no cycles. Such problems 
have been extensively studied under the conditions where this probability is 0(1), see e.g. [13]. 
Our theorem concerns the large-deviation properties of this event: 

Theorem 2.4 Let L be the event that £f(n, a / n ) contains no cycles. Then 

lim P na (L)V» = ( aeXp ("f + E) > > L (2 . 9) 

rc->oo ' I i ) otherwise. 

Strictly speaking, this result is not needed for the proof of our main theorem; it is actually 
used to derive the exponential decay for the probability of the event that W(n, a / n ) contains only 
"small" components. Surprisingly, the decay rates for these two events are exactly the same: 

Theorem 2.5 Let L be the event that Sf(n, a / n ) contains no cycles and let B r be the event that 
there are no components larger than r. Then 

lim liminf P n , a {B r ) l l n = lim lim sup P na (B en ) 1 l n = lim P noi (V) x l n . (2.10) 
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Update: In the present paper we prove Theorem 2.4 using enumeration and generating-function 
techniques. Recently, a probabilistic approach has been developed by which we obtain an expan- 
sion of P n , a (L n ) to quantities of order unity. One advantage of the new approach is that it also 
permits the analysis of the conditional measure P n , a (-\L n ); see [9]. 

To finish the discussion of our results, let us give some reason for the word "thermodynamic" 
in the title. The motivation comes from an analogy with droplet formation in systems at phase 
transition. Such situations have been studied extensively in the context of percolation [1, 16] and 
Ising (and Potts) model [19, 23, 7, 10, 17] under the banner of "Wulff construction," see [11] for 
a review of these matters. 

One of the principal questions underlying Wulff construction is as follows: Compute the 
probability — and the characteristics of typical configurations carrying the event — that a given 
fraction of the system is in one thermodynamic state (e.g., liquid) while the rest is in another 
state (e.g., gas). It turns out that the typical configurations are such that the two phases separate; 
a droplet of one phase "floats" in the other phase. The requisite probability is then given by a 
large-deviation expression whose rate function is composed of three parts: the "surface" energy 
and entropy of the droplet, the rate function for the probability that the droplet is all in one phase, 
and the rate function for the probability that the complement of the droplet is in the other phase. 

In the case under study, the droplet is exactly the giant component and its weight is just the 
probability that all vertices in the droplet are connected to each other. The "surface" energy is 
(the log of) the probability that no vertex inside is connected to no vertex outside; the entropy 
is (the log of) the number of ways to choose the corresponding number of sites. The weight of 
the phase outside simply amounts to the probability that all remaining components are of sub- 
macroscopic scale. When the leading-order exponential decay rate of all of these contributions is 
extracted using Theorems 2.3-2.5, we get a quantity that only depends on the fraction of vertices 
taken by the droplet. The resulting expression is the one on the right-hand side of (2.6). 

3. Everybody connected 

The goal of this section is to prove Theorem 2.3. Our proof is based on showing that the proba- 
bility in (2.8) is exactly the same probability in a related, directed graph problem. 

For a collection of vertices Y n = {1, . . . , n} and a set of edge probabilities (pki)i<k<i<n, let <$ 
be the inhomogeneous undirected random graph over Y n . Similarly, let denote the inhomoge- 
neous directed complete random graph with the restriction that the two possible (directed) edges 
between k and I occur independently, each with probability pk\. To keep our notation distinct 
from the special case pki = a / n treated throughout this paper, we will write P instead of P n , a - 

Definition 3.1 A labelled directed graph & = (Y, S) is said to be grounded at vertex v £f if 
for every w G Y there exists a (directed) path from w tov in S. 

The identification of the two problems is now stated as follows: 

Lemma 3.2 Let K be the event that is connected and let G be the event that is grounded 
at vertex "1." Then P{K) = P(G). 
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Proof. We use induction on the total number of edges incident with vertex "re." Indeed, if p/, n = 
for all k = 1, . . . , n — 1, then P{K) = P(G) because both probabilities are zero. Now let us 
suppose that P{K) = P{G) when pi n = for all I = k, . . . , re — 1 and let us prove that it 
also for pfcn > 0. It clearly suffices to show that the partial derivatives of P(K) and P(G) with 
respect to pk n are equal for all pk n € [0, 1]. 

Notice first that both K and G are increasing events. Invoking Russo's formula, see [35] or [22, 
Theorem 2.25], we obtain 

( P(G) = P((n, fc) is pivotal for G) , (3.1) 



where the event {(re, fc) is pivotal for G} means that if (re, fc) is occupied, the event G occurs and 
if not, it does not. (Note that (re, k) denotes the edge going from "re" to "A;.") The conditions 
under which this event occurs are straightforward: The set "V n = {1, . . . , re} splits into two 
disjoint components, one rooted at "1" and the other at "n," such that no vertex in the component 
associated with vertex "n" has an oriented edge to the other component and k has an oriented 
path to 1. Similarly, we have 

Q 

P{K) = P((n, k) is pivotal for K). (3.2) 



dpkn 

Here {(re, k) is pivotal for K} simply means that, if the edge (re, k) is absent, y n consist of two 
connected components, one containing "1" and the other containing "re." 

To see the equality of partial derivatives, we split both "pivotal" events according to the com- 
ponent containing the vertex "re." If W is a set of vertices such that n G W and 1 W , let ^n^/ 
and be the restrictions of to 5^, and ~¥ n \W , respectively. Similarly, let and 
be the corresponding "components" of the oriented graph. Let K n ^ and K\ yr be the events 
that ^ n ^/ and ^\^/ are connected and let G n yr and be the events that is grounded 
at "re" and that ^i,^ is grounded at "1," respectively. Since these pairs of events are independent, 
we have 

P((n, k) is pivotal for G) = £ P(G hW )P(G n ^)P(C w )\ pkj _ , (3.3) 

1„" 



where is the event that no vertex in 5^ has a (directed) edge to Y n \ W. But the induction 
assumption tells us that P{G\^) = P(K\^) and P{G n ,w) = P{Kn,w)> an d the symmetry 
of edge probabilities for the directed graph tells us that P{Cy^) is the probability that Sf n) ^, 
and are not connected by an edge in Sf. Substituting these into (3.3), we get the right-hand 
side of (3.2). This completes the induction step. □ 

From now on, let K and G pertain to the specific random graphs £f (n, a / n ) and £f (n, Q / n ). We 
begin with upper and lower bounds on P n ^ a (K): 

Lemma 3.3 P n>a (K) < (l - (1 - "/n)™ -1 )"" 1 . 
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Proof. Let E be the event — concerning the graph W(n, a / n ) — that every vertex except number "1" 
has at least one outgoing edge. Then G C E and so 

PnAG) < PnA E ) = (l - (1 - a /n) n_1 ) n_1 • (3-4) 

Invoking Lemma 3.2, this proves the desired upper bound. □ 

We remark that the upper bound in Lemma 3.3 has been discovered (and rediscovered) several 
times in the past. It seems to have appeared in [36] for the first time and later in [26, 37] and 
also [25]. A generalization to arbitrary connected graphs has been achieved in [27]. 

Lemma 3.4 P n , a (K) > (l - (1 - "/n)" -1 )" -1 \. 

Proof. Consider the following events for directed random graph (n, a / n ): Let E be the event 
that every vertex, except vertex number "1," has at least one outgoing edge and let F be the event 
every such vertex has exactly one outgoing edge. Since G C E, we have 

PnAG) = PnAE)PnAG\E). (3.5) 

We claim that 

P n AG\E) > P n AG\F). (3.6) 

Indeed, let us pick an outgoing edge for each vertex different from "1," uniformly out of all edges 
going out of that vertex, and let us color these edges red. Let G' be the event that G occurs using 
only the red edges. The distribution of red edges conditional on E is the same as conditional 
on F. Hence P n>a (G\E) > P nta (G'\E) = P n AG'\F). But, on F, every available edge is red 
and so P n AG'\F) = P U:a (G\F). Combining these inequalities, (3.6) is proved. 

The number of configurations that Sf(n, a / n ) can take on F is exactly [n — l) n_1 . On the other 
hand, the number of configurations which result in Sf(n, a / n ) being grounded is a n = n n ~ 2 — the 
number of labelled trees with n vertices. Hence 

PnAGW) > - ( r— T > -• (3-7) 

Using that P„, a (£) = (1 - (1 - "/n)™" 1 )"" 1 the desired bound follows. □ 
Proof of Theorem 2.3. The claim is proved by noting 

/ . \ n— 1 

(l - (1 - «/ n ) n - 1 ) 

n— >oo 

and using the results of Lemmas 3.3 and 3.4. □ 



lim - (1 _ e - a) n-i = ex P ( (! " - ) (3.8) 



4. Only trees 



Here we will assemble the necessary ingredients for the proof of Theorem 2.4. The proof is based 
on somewhat detailed combinatorial estimates and arguments using generating functions. 
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Recall that L denotes the event that W(n, a / n ) contains no cycles and that B r denotes the event 
that all components of W(n, a / n ) have no more than r vertices. We begin by a combinatorial 
representation of the probability P n , a {L n B r ): Let at denote the number of labeled trees on I 
vertices. Then 



mt=0W>r 



fc=l 

(4.1) 

where we set k = J2e m h applied the constraint J2e = n an( ^ l et Qn,k,r denote the sum 

e n(f)""^ («> 

J2m e e=n £>1 
J2 mt=k 
m e =0V£>r 

We begin by isolating the large-n, k behavior of this quantity: 
Proposition 4.1 Consider the polynomial 

Ms) = t S -^ (4-3) 
1 F (s) k 

Q n , k ,r<-r;mf^y-. (4.4) 
k\ s>o s n 

Moreover, for each ry > 0, ?/iere is no < oo and a sequence (c r ) r >i of positive numbers for which 

c r 1 F r (s) fc 
Q n , fe , r >^^inf^i- ( 4 - 5 ) 
vn fc! s>o s 11 



Then for all n,k,r > 1, 



holds for all n > no, all k > 1 and all r > 2 such that k < (1 — n)n a«<i r/c > n(l + n). 
Proof of upper bound. Let us consider the generating function 

oo n 

Q r (s, z) = 1 + Qn,k, r z k s n = exp{zF r (s)}, (4.6) 

71=1 k=l 

where we used Fubini-Tonelli to derive the second equality. Since F r is a polynomial, the Cauchy 
integral formula yields 

where all integrals are over a circle of positive radius centered at the origin of C. Since all 
coefficients of F r are non-negative, ^ \F r (se l9 )\ for s > is maximized at 6 = 0. Bounding 
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the integrand by its value at 9 = 0, the integral yields a factor 2ir; optimizing over s > then 
gives the upper bound in (4.4). □ 

Proof of lower bound. As is common in Tauberian arguments, the lower bound will require 
somewhat more effort. First let us note that under the conditions k < (1 — rj)n and rk > n(l + rj) 
the function s i— > F r (s) k /s n , for s > 0, blows up both at and oo. Its minimum is thus 
achieved at an interior point; for the rest of this proof we will fix s to a minimizer of this function. 
Since \F r (se ld )\ < F r (s) for all 6 G (— n, it] \ {0}, the part of the integral in (4.7) corresponding 
to \9\ > e is exponentially small (in n) compared to the infimum in (4.5). We thus need to show 
the lower bound only for the portion of the integral over 6 with \9\ < e, for some fixed e > 0. 

Since F r has positive coefficients, F r / in the (complex) e-neighborhood of s. This allows 
us to define the function 

F fse i6 V 

g(9) = log r[S w > , \9\<e, (4.8) 

where g plays the role of k / n . The function g is analytic in an 0(e)-neighborhood of the origin. 
The choice of s implies that </(0) = which is equivalent to 

F r (s) Q 

For the second derivative we get g"(0) = —gVar(X), where X is the random variable with law 

In particular, since our restrictions on g between ^(1 + 77) and 1 — 77 imply that s is bounded away 
from zero, this law is non-degenrate and so g"(0) < 0. 

The analyticity of 9 g{9) for 9 = 0(e) implies that g'" is bounded in this neighborhood, 
and so by Taylor's theorem we have 

g(9)=g(0)-A9 2 + O(9 3 ), (4.11) 

where A = A(r, g) is positive uniformly in the allowed range of g's and 0(9 3 ) is a quantity 
bounded by \9\ 3 times a constant depending only on r, e and 77. (In particular, we may assume 
that 0(9 3 ) is dominated by \A9 2 for \9\ < e.) 

We will split the integral over 9 G [— e, e] into two more parts. Let S > and note that ng(0) 
is the logarithm of the infimum in (4.5). Then for 9 with <5n -1 / 3 < \9\ < e we have 

nReg(9) < ng(0) - ^A5 2 n l/3 (4.12) 

which shows that even this portion of the integral brings a contribution that is negligible compared 
to the right-hand side of (4.5). But for \9\ < dn^ 1 / 3 we have nO(9 3 ) = 0(8) and so for <5 <S 1, 
the Taylor remainder will always have imaginary part between, say, — V4 and V4. This means that 

Re/ e »#)dfl>>(°) / t- nM d9 > -^- e n 9(°) (4.13) 

for some constant c > which may depend on r and 77 but not on g and n. Combined with the 
previous estimates, this proves the lower bound (4.5). □ 
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In light of the above lemma, the k-th term in the sum on the extreme right of (4.1) becomes 



a- k n k c- a -Q nAr = e°W inf exp{ne r (s, */„)}, (4.14) 
where 

@ r (s, g) = -gloga - glogg + g + glogF r (s) - logs. (4.15) 
Here we should interpret (4.14) as an upper bound for r = n and a lower bound for fixed r. It is 
clear that, regardless of r, the sum is dominated by k = [gn\ for which g inf s> o & r (s, g) is 
maximal. Such values are characterized as follows: 

Lemma 4.2 Let a > and r > 2. Then there is a unique (s r , g r ) £ [0, oo] x [Vr, l]/or which 

Q r (s r ,6r)= sup inf Q r (s,g). (4.16) 

l/r<£,<l s>0 

Moreover, we always have s r G (0, oo) and g r G (Vr, 1) and, furthermore, 

f 1 + a / 2 - log a, ifa<l, 
lim & r (s r ,g r ) = < (4.17) 
[1 + ^, ifa>l. 

Proof. We begin by ruling out the "boundary values" of s and g. First, if g = l / r , then the infimum 
over s is actually achieved by s = oo. In that case F r (s) = oo and the (one-sided) derivative 
with respect to g is infinite, i.e., g = l / r is a strict local minimum of g ^ inf s> o B r (s, g). 
Similarly, for g = 1 the infimum over s > is achieved at s = but then the ^-derivative of 
g i ^ inf s> o 9 r (s, g) is negative infinity, i.e., also g = 1 is a strict local minimum. It follows that 
any (s r , g r ) satisfying (4.16) necessarily lies in (0, oo) x (Vr, 1)- 

Setting the partial derivatives with respect to s and g to zero shows that any minimizing pair is 
the solution of the equations 

F r (s)=ag and sF^s) = a. (4.18) 

In light of monotonicity of s i-> sF^(s), the solution is actually unique. To figure out the asymp- 
totic as r — ► oo, we note that for s < Ve, 

r £ 

^=i v ; ' 

where is the unique number in [0, Ve] such that We~ w = s. (Incidentally, W is closely 
related to the survival probability of the Galton-Watson branching process with Poisson offspring 
distribution.) If s > Ve, then sF^(s) — > oo as r — > oo. Using the relation between sF^(s) and a, 
we thus get 

[ ae _a , if a < 1, 

s r — <^ " (4.20) 

r ^°° (Ve, if a > 1. 

Integrating the derivative of F r now shows that F r (s r ) — > a(l — a /2) for a < 1. Using that Fj.(s) 
is bounded for s < s r , we also find that F r (s r ) — > V2 for a > 1. This yields 

l- a / 2 , ifa<l, 

(4.21) 

if a > 1. 
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Noting that Q(s r , g r ) = g r — log s r we now get (4.17). □ 

Proof of Theorem 2.4. By the fact that the supremum over g in (4.16) is achieved at an interior 
point, we can control the difference between the maximizing k / n and its continuous counterpart g. 
Thence 

-J e" an/i exp{ne r (s r ,£ r )}, (4.22) 

where 

< Qn,r < n (4.23) 

V n 

for some positive constants c r which may depend on r and a. Since B n contains every realization 
of Sf(n, a / n ), taking r = n and applying Lemma 4.2 directly shows that P a ^ n (L) < e n *( Q? )+°(™)_ 
To get a corresponding lower bound, we fix r > 2 and apply P a ^ n {L) > P a ^ n (LnB r ). Taking 1 / n - 
th power and letting n — ► oo then yields 

lim P n JL n B r ) l l n = ae - l - a / 2 +®r{s r , er )^ (4 24) 

n— >oo ' 

As we have just checked, the right-hand side tends to e*( a ) as r — > oo. □ 
Corollary 4.3 W? Ziave 

lim lim P Q , n ( J B r nL) 1 / n = lim Pa^L) 1 ^ 1 . (4.25) 

r— >oon— >oo n— >oo 

Proo/ This summarizes the last step of the previous proof. □ 



5. NO BIG = NO CYCLES 

Here we will prove that absence of large component has a comparable cost to absence of cycles, 
at least on an exponential scale. To achieve this goal, apart from Corollary 4.3, we will need the 
following upper bound: 

Lemma 5.1 Let B r be the event that &(n, a / n ) has no components larger than r and let L be 
the event that all connected components of@(n, a / n ) are trees. Then for all r > 1, 

/ a \ —hrn 

PnA B r) < PnA L )( l ~ -) ■ (5-1) 



Proof. Let C be the restriction of (n, a / n ) to a set S C {1, ... , n}. Let T be a tree on S. Then 



Hence 



P n , a (C is connected) < P n<a (C D T) < \1 J P n , a (C is a tree). (5.3) 

Now, if L r is the event that no component of W(n, a / n ) of size larger than r has cycles, then 
B r C L r and so P n ^ a (B r ) < P n Q! (L r ). Let {Sj} be a partition of {1, ... , n} and let P n ,a({Sj}) 
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denote the probability that {Sj} are the connected components of Sf(n, a / n ). Then 

Pn,a( L r) = £ P nA{Sj})PnA L r\{Sj}), (5.4) 

where P n ,a(L r \{Sj}) is the conditional probability of L r given that {S'j} are the connected com- 
ponents of (n, a / ra ). 

Letting Cj represent the restriction of (n, a / n ) to Sj, the bound (5.3) tells us that 

P n ,a( L r\{Sj}) = \ I P n ,a( C j is a tree|Cj is connected) 

J- \Sj\>r 



Pn,a(Cj is a treejCj is connected) | | ^1 J 



j: |^|<r 



(5.5) 

71/ 



Using that \Sj\ < r for every Sj contributing to the second product and applying that the sum 
of | Sj | over the components with \Sj\ < r gives at most n, we then get 



1--J . (5.6) 

Plugging this back in (5.4), the desired bound follows. □ 
Proof of Theorem 2.5. By Lemma 5.1 we have 

limsupiV^) 1 /™ < e e / 2 lim P n , a (L) l/n - (5.7) 

n — >oo n >0 ° 

On the other hand, the inclusion B r D B r n L and Corollary 4.3 yield 

liminfP n , a (5 r ) 1 / n ^liminfP^^nL) 1 /™ _> l im P (L). (5.8) 

n^oo n^oo r— »oo ra^oo 

Since P n ^ a (B r ) < P„ ia (P en ) eventually for any fixed r > 1 and e > 0, all limiting quantities are 
equal provided we take r — > oo and/or e J. after n — > oo. □ 

6. Proof of main result 

Before we start proving our main result, Theorem 2. 1, we need to ensure that if a large component 
is present in the graph, then it is unique. The statement we need is as follows: 

Lemma 6.1 Let K e ^ be the event that Sf(n, a / n ) is either connected or has exactly two con- 
nected components, each of which is of size at least en, and recall that K is the event that %?(n, a / n ) 
is connected. Then for all Qo > and eo > there exists c\ = ci(ao,eo) < 1 such that for 
all e > €q and all a < ao, 

limsupP Qin (K c |iv' ei2 ) 1 / n < d. (6.1) 

n— >oo 

Proof. It clearly suffices to show that the ratio of P a , n (-^e,2 \ K) and P a ^ n {K) decays to zero 
exponentially with n, with a rate that is uniformly bounded in e > eo and a < qq- In light of 
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Theorem 2.3 and the fact that K e 2 only admits components that grow linearly with n, we have 

PaAKeAK) o[n) ^ (n\ ^{a k / n ) k ^Ml- k /n)) n ~ k axfcCn-fc) 

P«,n(*) " ^ V 7r!(a)» ^ J ' ^ 

' en<k<n—en 

where o{n)/n tends to zero uniformly in e > eo and a < «o- Writing £> for k / n , the expression 
under the sum can be bounded by e n l~(e)-~(°)] , where 

E(q) = -S(q) + glog^iag) + (1 - g) log7Ti(a(l - g)) - ag(l - g). (6.3) 

Since g is restricted to the interval [e, 1 — e], the right-hand side of (6.2) will be exponentially 
small if we can show E(g) < 2(0) for all g G (0, 1) and all a. 

As is easy to check, the function g h-> E(g) is symmetric about the point g = V2. Hence, if 
we can prove that it is strictly convex throughout [0, 1], then it is maximized at the endpoints. 
Introducing the function 

G(r/)=r/ l og ^M (6 .4) 
77 

we have 

a~(v/ a ) = G(rj) + G(a - 77) + r?(a - r/). (6.5) 
In order to prove strict convexity of 2, it thus suffices to show that G"(rj) + 1 > for all 77 > 0. 
Introducing yet another abbreviation q(r]) = 77/(1 — e - ''), a tedious but straightforward differen- 
tiation yields 

G"( V ) + l = kq'-q)(qe-'l-l). (6.6) 

A direct evaluation now shows that both q' — q and qe^ 11 — 1 are negative once 77 > 0. □ 
We will use the above lemma via the following simple conclusion: 

Lemma 6.2 Let N r denote the number of connected components of&(n, a / n ) of size at least r 
and let % be the set of vertices contained in these components. Then for all a > and q > e > 
there exists c = c(e, g, a) > such that 

Pa,n{\%n\ = lQn\ k N en = 1) > (1 -e- Cn )P a , n {\%n\ = LH)- (6-7) 

Proof. Clearly, (6.7) will follow if we can prove that 

P a ,n(\%n\ = [Qn\ & N en > 1) < e- cn P a , n (\% n \ = [gn\) . (6.8) 

Let y(x) denote the connected component of W(n, a / n ) containing the vertex x and let x ^ y 
denote the event that x, y G % n but y(x) n V(y) = 0- Then (6.8) will be proved once we show 

P a ,n(\%n\ = [<m\ & x y) < €~ 2cn P a , a (\% n \ = [gn\). (6.9) 

(Indeed, the sum over x, y adds only a multiplicative factor of n 2 on the right-hand side.) By 
conditioning on the set Y en and the set y(x) U "V{y), this inequality will in turn follow from 

P a ,n{x ^yhtix) U y{y) = r)< e- 2cn P a , n {r(x) U y{y) = r). (6.10) 

Indeed, let us multiply both sides by the probability that Y is disconnected from the rest of the 
graph and that all components disjoint from V of size at least en take the total volume [gn\ — 
The sum over all admissible Y reduces (6.10) to (6.9). 
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We will deduce (6.10) from Lemma 6.1. Recall that K is the event that the graph is connected 
and K £: 2 is the event that it has at most two components, each of which is of size at least en. 
We will now use these events for the restriction of &(n, a / n ) to "V: Let m = a = oJ^ 
and e = e— . Then we have 

m 

{x ^ y} n {r(x) U Y(y) = V] C K c n Kz i2 , (6.11) 

while for the event on the right-hand side of (6.10) we simply get 

{f(x)U%)=f} = % (6.12) 

By Lemma 6.1 and the fact that a < a and e > e, 

Pa, m (K c \K l2 ) < e- cim , (6.13) 

once n is sufficiently large. But m > 2en and so (6.10) holds with c = eci. □ 

Now we have finally amassed all ingredients needed for the proof of our main result. 

Proof of Theorem 2.1. The case g = is quickly reduced to Theorems 2.4-2.5 while g = 1 boils 
down to Theorem 2.3. Thus, we are down to the cases g £ (0, 1). Let e G (0, £?). By Lemma 6.2, 
we can focus on the situations with N en = 1. To make our notation simple, let us assume that gn 
is an integer. Then we have 

P a ,n(\Ven\ = Qn&N en = l) = ( J P e n,a e (^)Pn- e n,a(l- e ) (B m ) [1 ~ -) ■ (6.14) 

The terms on the right-hand side represent the following: the number of ways to choose the 
unique component of size gn, the probability that this component is connected, the probability 
that the complement contains no component of size larger than en and, finally, the probability that 
the two parts of the graph do not have any edge between them. Invoking Stirling's formula to deal 
with the binomial term, and plugging explicit expressions for P sn ,ae{K) and P n - en ,a{i-()){Ben) 
from Theorems 2.3-2.5, the result reduces to a simple calculation. □ 
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